Word recognition through mapping of lip movements from speech utterance using audiovisual fusion and MLP

نویسندگان

چکیده

Speech has information more than text, but under noisy environment speech sufferance from disadvantage of not properly decoded by humans and same is true with machines. being bimodal along audio features if we augment visual specifically related to lip movements. the degree recognition can be improved. The objective this work use aid word recognition. In extracted MFCC for Geometrical movements together used in machine learning algorithm predict utterances. Videos utterances are TIMID database. With statistical corresponding form input feature vector (Multi-layer perceptron). experimental results show that using MLP have obtained a accuracy 91% KNN Classifier attained 61%. presented here important implications applications HMI communication helps hearing impaired.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lip movements affect infants' audiovisual speech perception.

Speech is robustly audiovisual from early in infancy. Here we show that audiovisual speech perception in 4.5-month-old infants is influenced by sensorimotor information related to the lip movements they make while chewing or sucking. Experiment 1 consisted of a classic audiovisual matching procedure, in which two simultaneously displayed talking faces (visual [i] and [u]) were presented with a ...

متن کامل

Thai Word Recognition Using Hybrid MLP-HMM

The Hidden Markov Model (HMM) is a popular model for speech recognition systems. However, one of the difficulties in applying HMM is the estimation of the emission probabilities for constructing the Gaussian Mixture Models (GMMs). In this paper, we propose a method to estimate the state emission probabilities in HMM framework using Artificial Neural Networks (ANNs), particularly the Multi-Layer...

متن کامل

Whole-Word Recognition from Articulatory Movements for Silent Speech Interfaces

Articulation-based silent speech interfaces convert silently produced speech movements into audible words. These systems are still in their experimental stages, but have significant potential for facilitating oral communication in persons with laryngectomy or speech impairments. In this paper, we report the result of a novel, real-time algorithm that recognizes wholewords based on articulatory ...

متن کامل

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

 In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Health Sciences (IJHS)

سال: 2022

ISSN: ['2550-6978', '2550-696X']

DOI: https://doi.org/10.53730/ijhs.v6ns2.6078